Picture for Weihua Luo

Weihua Luo

AI Business, Alibaba Group

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Add code
May 08, 2025
Viaarxiv icon

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Add code
May 05, 2025
Viaarxiv icon

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Add code
Apr 22, 2025
Viaarxiv icon

The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Add code
Apr 14, 2025
Viaarxiv icon

A Unified Agentic Framework for Evaluating Conditional Image Generation

Add code
Apr 09, 2025
Viaarxiv icon

New Trends for Modern Machine Translation with Large Reasoning Models

Add code
Mar 13, 2025
Viaarxiv icon

Towards Widening The Distillation Bottleneck for Reasoning Models

Add code
Mar 03, 2025
Viaarxiv icon

Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models

Add code
Feb 19, 2025
Viaarxiv icon

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

Add code
Feb 18, 2025
Viaarxiv icon

LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy

Add code
Feb 17, 2025
Viaarxiv icon